Dataset statistics
| Number of variables | 11 |
|---|---|
| Number of observations | 13213641 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.1 GiB |
| Average record size in memory | 88.0 B |
Variable types
| Numeric | 11 |
|---|
LongitudAcc is highly correlated with Fuel Rate and 2 other fields | High correlation |
EngineSpeed is highly correlated with EngineAirInletPressure and 1 other fields | High correlation |
Fuel Rate is highly correlated with Engine Load and 1 other fields | High correlation |
Engine Load is highly correlated with Boost Pressure and 2 other fields | High correlation |
Boost Pressure is highly correlated with Engine Load and 2 other fields | High correlation |
EngineAirInletPressure is highly correlated with EngineSpeed and 3 other fields | High correlation |
AcceleratorPedalPos is highly correlated with Engine Load and 3 other fields | High correlation |
VehicleSpeed is highly correlated with EngineSpeed | High correlation |
BrakePedalPos is highly correlated with AcceleratorPedalPos | High correlation |
Fuel Rate is highly skewed (γ1 = 46.75265747) | Skewed |
Timestamp has unique values | Unique |
LongitudAcc has 3062406 (23.2%) zeros | Zeros |
EngineSpeed has 226548 (1.7%) zeros | Zeros |
Fuel Rate has 3065605 (23.2%) zeros | Zeros |
Engine Load has 3079422 (23.3%) zeros | Zeros |
Boost Pressure has 550952 (4.2%) zeros | Zeros |
AcceleratorPedalPos has 5299357 (40.1%) zeros | Zeros |
VehicleSpeed has 1889729 (14.3%) zeros | Zeros |
BrakePedalPos has 10745857 (81.3%) zeros | Zeros |
Reproduction
| Analysis started | 2022-11-23 15:22:07.425704 |
|---|---|
| Analysis finished | 2022-11-23 15:37:08.037861 |
| Duration | 15 minutes and 0.61 seconds |
| Software version | pandas-profiling v3.4.0 |
| Download configuration | config.json |
| Distinct | 13213641 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.821576415 × 1010 |
| Minimum | 1.717422031 × 1010 |
|---|---|
| Maximum | 8.030538652 × 1010 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 100.8 MiB |
Quantile statistics
| Minimum | 1.717422031 × 1010 |
|---|---|
| 5-th percentile | 1.927590015 × 1010 |
| Q1 | 2.801417485 × 1010 |
| median | 5.497760062 × 1010 |
| Q3 | 6.535583896 × 1010 |
| 95-th percentile | 7.74541506 × 1010 |
| Maximum | 8.030538652 × 1010 |
| Range | 6.313116622 × 1010 |
| Interquartile range (IQR) | 3.734166411 × 1010 |
Descriptive statistics
| Standard deviation | 2.025682083 × 1010 |
|---|---|
| Coefficient of variation (CV) | 0.4201285862 |
| Kurtosis | -1.530225936 |
| Mean | 4.821576415 × 1010 |
| Median Absolute Deviation (MAD) | 1.971862689 × 1010 |
| Skewness | -0.04471569004 |
| Sum | 6.37105798 × 1017 |
| Variance | 4.1033879 × 1020 |
| Monotonicity | Strictly increasing |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 1.717422031 × 1010 | 1 | < 0.1% |
| 6.164821529 × 1010 | 1 | < 0.1% |
| 6.164820448 × 1010 | 1 | < 0.1% |
| 6.164820559 × 1010 | 1 | < 0.1% |
| 6.164820738 × 1010 | 1 | < 0.1% |
| 6.164820848 × 1010 | 1 | < 0.1% |
| 6.164820928 × 1010 | 1 | < 0.1% |
| 6.164821044 × 1010 | 1 | < 0.1% |
| 6.164821157 × 1010 | 1 | < 0.1% |
| 6.164821229 × 1010 | 1 | < 0.1% |
| Other values (13213631) | 13213631 |
| Value | Count | Frequency (%) |
| 1.717422031 × 1010 | 1 | |
| 1.717422136 × 1010 | 1 | |
| 1.717422332 × 1010 | 1 | |
| 1.71742241 × 1010 | 1 | |
| 1.717422526 × 1010 | 1 | |
| 1.717422632 × 1010 | 1 | |
| 1.717422828 × 1010 | 1 | |
| 1.717422932 × 1010 | 1 | |
| 1.717423034 × 1010 | 1 | |
| 1.717423121 × 1010 | 1 |
| Value | Count | Frequency (%) |
| 8.030538652 × 1010 | 1 | |
| 8.030538545 × 1010 | 1 | |
| 8.030538460 × 1010 | 1 | |
| 8.030538353 × 1010 | 1 | |
| 8.030538253 × 1010 | 1 | |
| 8.030538161 × 1010 | 1 | |
| 8.030538053 × 1010 | 1 | |
| 8.030537945 × 1010 | 1 | |
| 8.030537847 × 1010 | 1 | |
| 8.030537758 × 1010 | 1 |
WetTankAirPressure
Real number (ℝ≥0)
| Distinct | 218 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.04196288 |
| Minimum | 0 |
|---|---|
| Maximum | 14.96215 |
| Zeros | 270 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 100.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 9.9288 |
| Q1 | 10.68725 |
| median | 11.23885 |
| Q3 | 11.5836 |
| 95-th percentile | 11.9973 |
| Maximum | 14.96215 |
| Range | 14.96215 |
| Interquartile range (IQR) | 0.89635 |
Descriptive statistics
| Standard deviation | 1.06515459 |
|---|---|
| Coefficient of variation (CV) | 0.09646424297 |
| Kurtosis | 38.23521631 |
| Mean | 11.04196288 |
| Median Absolute Deviation (MAD) | 0.4137 |
| Skewness | -4.758264963 |
| Sum | 145904533.5 |
| Variance | 1.134554301 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 11.4457 | 628564 | 4.8% |
| 11.51465 | 620294 | 4.7% |
| 11.37675 | 618965 | 4.7% |
| 11.5836 | 608709 | 4.6% |
| 11.23885 | 586999 | 4.4% |
| 11.7215 | 570816 | 4.3% |
| 11.79045 | 550803 | 4.2% |
| 11.1699 | 549851 | 4.2% |
| 11.8594 | 498123 | 3.8% |
| 11.10095 | 495994 | 3.8% |
| Other values (208) | 7484523 |
| Value | Count | Frequency (%) |
| 0 | 270 | < 0.1% |
| 0.06895 | 22591 | |
| 0.1379 | 3506 | < 0.1% |
| 0.20685 | 613 | < 0.1% |
| 0.2758 | 303 | < 0.1% |
| 0.34475 | 304 | < 0.1% |
| 0.4137 | 257 | < 0.1% |
| 0.48265 | 584 | < 0.1% |
| 0.5516 | 921 | < 0.1% |
| 0.62055 | 545 | < 0.1% |
| Value | Count | Frequency (%) |
| 14.96215 | 26277 | |
| 14.8932 | 24 | < 0.1% |
| 14.82425 | 33 | < 0.1% |
| 14.7553 | 31 | < 0.1% |
| 14.68635 | 41 | < 0.1% |
| 14.6174 | 57 | < 0.1% |
| 14.54845 | 64 | < 0.1% |
| 14.4795 | 63 | < 0.1% |
| 14.41055 | 46 | < 0.1% |
| 14.3416 | 32 | < 0.1% |
| Distinct | 138 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.01091301784 |
| Minimum | -12.5 |
|---|---|
| Maximum | 13 |
| Zeros | 3062406 |
| Zeros (%) | 23.2% |
| Negative | 5504677 |
| Negative (%) | 41.7% |
| Memory size | 100.8 MiB |
Quantile statistics
| Minimum | -12.5 |
|---|---|
| 5-th percentile | -1 |
| Q1 | -0.2 |
| median | 0 |
| Q3 | 0.2 |
| 95-th percentile | 0.8 |
| Maximum | 13 |
| Range | 25.5 |
| Interquartile range (IQR) | 0.4 |
Descriptive statistics
| Standard deviation | 0.7855852151 |
|---|---|
| Coefficient of variation (CV) | -71.98606531 |
| Kurtosis | 156.3525718 |
| Mean | -0.01091301784 |
| Median Absolute Deviation (MAD) | 0.2 |
| Skewness | 9.368671831 |
| Sum | -144200.7 |
| Variance | 0.6171441302 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 3062406 | |
| -0.1 | 1277113 | |
| -0.2 | 1109964 | 8.4% |
| 0.1 | 1072153 | 8.1% |
| 0.2 | 814254 | 6.2% |
| -0.3 | 810250 | 6.1% |
| 0.3 | 619201 | 4.7% |
| -0.4 | 565264 | 4.3% |
| 0.4 | 453130 | 3.4% |
| 0.5 | 365707 | 2.8% |
| Other values (128) | 3064199 |
| Value | Count | Frequency (%) |
| -12.5 | 1 | |
| -9.9 | 1 | |
| -8.9 | 1 | |
| -8 | 2 | |
| -7.6 | 1 | |
| -7.3 | 1 | |
| -7.2 | 1 | |
| -7.1 | 1 | |
| -6.8 | 2 | |
| -6.7 | 2 |
| Value | Count | Frequency (%) |
| 13 | 22204 | |
| 12.9 | 5758 | < 0.1% |
| 7.5 | 1 | < 0.1% |
| 6.5 | 1 | < 0.1% |
| 6.2 | 1 | < 0.1% |
| 6.1 | 3 | < 0.1% |
| 6 | 1 | < 0.1% |
| 5.7 | 1 | < 0.1% |
| 5.6 | 1 | < 0.1% |
| 5.5 | 3 | < 0.1% |
| Distinct | 11302 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1073.719842 |
| Minimum | 0 |
|---|---|
| Maximum | 8191.875 |
| Zeros | 226548 |
| Zeros (%) | 1.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 100.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 590.75 |
| Q1 | 897.25 |
| median | 1158.75 |
| Q3 | 1288.625 |
| 95-th percentile | 1463.875 |
| Maximum | 8191.875 |
| Range | 8191.875 |
| Interquartile range (IQR) | 391.375 |
Descriptive statistics
| Standard deviation | 321.1479959 |
|---|---|
| Coefficient of variation (CV) | 0.2990985015 |
| Kurtosis | 3.159375967 |
| Mean | 1073.719842 |
| Median Absolute Deviation (MAD) | 158.5 |
| Skewness | -0.7529860281 |
| Sum | 1.418774853 × 1010 |
| Variance | 103136.0353 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 226548 | 1.7% |
| 600.25 | 24214 | 0.2% |
| 599.75 | 24160 | 0.2% |
| 600 | 24069 | 0.2% |
| 600.5 | 23981 | 0.2% |
| 599.5 | 23669 | 0.2% |
| 599.25 | 23294 | 0.2% |
| 599 | 23128 | 0.2% |
| 600.875 | 23062 | 0.2% |
| 601.125 | 22695 | 0.2% |
| Other values (11292) | 12774821 |
| Value | Count | Frequency (%) |
| 0 | 226548 | |
| 13.875 | 1 | < 0.1% |
| 14.625 | 1 | < 0.1% |
| 17.625 | 1 | < 0.1% |
| 18.25 | 1 | < 0.1% |
| 19.125 | 1 | < 0.1% |
| 19.625 | 1 | < 0.1% |
| 21.625 | 1 | < 0.1% |
| 24.375 | 1 | < 0.1% |
| 24.625 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 8191.875 | 135 | |
| 8031.875 | 1 | < 0.1% |
| 2296.25 | 1 | < 0.1% |
| 2267 | 1 | < 0.1% |
| 2146.875 | 1 | < 0.1% |
| 2144.5 | 1 | < 0.1% |
| 2141.375 | 1 | < 0.1% |
| 2138.625 | 1 | < 0.1% |
| 2137.25 | 2 | < 0.1% |
| 2136.75 | 1 | < 0.1% |
| Distinct | 1103 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.65940458 |
| Minimum | 0 |
|---|---|
| Maximum | 3876.198645 |
| Zeros | 3065605 |
| Zeros (%) | 23.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 100.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1.301234 |
| median | 8.043992 |
| Q3 | 21.88439 |
| 95-th percentile | 48.382246 |
| Maximum | 3876.198645 |
| Range | 3876.198645 |
| Interquartile range (IQR) | 20.583156 |
Descriptive statistics
| Standard deviation | 79.42546363 |
|---|---|
| Coefficient of variation (CV) | 5.072061534 |
| Kurtosis | 2269.129089 |
| Mean | 15.65940458 |
| Median Absolute Deviation (MAD) | 8.043992 |
| Skewness | 46.75265747 |
| Sum | 206917750.4 |
| Variance | 6308.404273 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 3065605 | 23.2% |
| 3.962849 | 90127 | 0.7% |
| 4.021996 | 88371 | 0.7% |
| 3.903702 | 86657 | 0.7% |
| 4.081143 | 85972 | 0.7% |
| 4.14029 | 80986 | 0.6% |
| 3.844555 | 79938 | 0.6% |
| 4.199437 | 76180 | 0.6% |
| 3.785408 | 71163 | 0.5% |
| 4.258584 | 70248 | 0.5% |
| Other values (1093) | 9418394 |
| Value | Count | Frequency (%) |
| 0 | 3065605 | |
| 0.059147 | 12406 | 0.1% |
| 0.118294 | 12098 | 0.1% |
| 0.177441 | 14606 | 0.1% |
| 0.236588 | 18361 | 0.1% |
| 0.295735 | 16841 | 0.1% |
| 0.354882 | 15150 | 0.1% |
| 0.414029 | 14139 | 0.1% |
| 0.473176 | 12138 | 0.1% |
| 0.532323 | 10374 | 0.1% |
| Value | Count | Frequency (%) |
| 3876.198645 | 5379 | |
| 65.120847 | 1 | < 0.1% |
| 65.0617 | 28 | < 0.1% |
| 65.002553 | 35 | < 0.1% |
| 64.943406 | 59 | < 0.1% |
| 64.884259 | 81 | < 0.1% |
| 64.825112 | 51 | < 0.1% |
| 64.765965 | 62 | < 0.1% |
| 64.706818 | 85 | < 0.1% |
| 64.647671 | 66 | < 0.1% |
| Distinct | 201 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 31.28595491 |
| Minimum | 0 |
|---|---|
| Maximum | 100 |
| Zeros | 3079422 |
| Zeros (%) | 23.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 100.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 3.5 |
| median | 26 |
| Q3 | 46 |
| 95-th percentile | 93.5 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range (IQR) | 42.5 |
Descriptive statistics
| Standard deviation | 28.20459248 |
|---|---|
| Coefficient of variation (CV) | 0.901509721 |
| Kurtosis | -0.07494185937 |
| Mean | 31.28595491 |
| Median Absolute Deviation (MAD) | 21 |
| Skewness | 0.8332060844 |
| Sum | 413401376.5 |
| Variance | 795.4990369 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 3079422 | 23.3% |
| 100 | 456077 | 3.5% |
| 23.5 | 180986 | 1.4% |
| 23 | 176421 | 1.3% |
| 24 | 172326 | 1.3% |
| 22.5 | 165340 | 1.3% |
| 24.5 | 163258 | 1.2% |
| 25 | 152021 | 1.2% |
| 22 | 147135 | 1.1% |
| 25.5 | 143977 | 1.1% |
| Other values (191) | 8376678 |
| Value | Count | Frequency (%) |
| 0 | 3079422 | |
| 0.5 | 52303 | 0.4% |
| 1 | 40498 | 0.3% |
| 1.5 | 31152 | 0.2% |
| 2 | 29473 | 0.2% |
| 2.5 | 26473 | 0.2% |
| 3 | 28233 | 0.2% |
| 3.5 | 26454 | 0.2% |
| 4 | 29420 | 0.2% |
| 4.5 | 27307 | 0.2% |
| Value | Count | Frequency (%) |
| 100 | 456077 | |
| 99.5 | 15915 | 0.1% |
| 99 | 15577 | 0.1% |
| 98.5 | 18060 | 0.1% |
| 98 | 16468 | 0.1% |
| 97.5 | 15875 | 0.1% |
| 97 | 15130 | 0.1% |
| 96.5 | 14932 | 0.1% |
| 96 | 15309 | 0.1% |
| 95.5 | 14706 | 0.1% |
| Distinct | 212 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.2487299203 |
| Minimum | 0 |
|---|---|
| Maximum | 1.818398 |
| Zeros | 550952 |
| Zeros (%) | 4.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 100.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.008618 |
| Q1 | 0.060326 |
| median | 0.137888 |
| Q3 | 0.336102 |
| 95-th percentile | 0.870418 |
| Maximum | 1.818398 |
| Range | 1.818398 |
| Interquartile range (IQR) | 0.275776 |
Descriptive statistics
| Standard deviation | 0.2841405335 |
|---|---|
| Coefficient of variation (CV) | 1.142365716 |
| Kurtosis | 3.765779068 |
| Mean | 0.2487299203 |
| Median Absolute Deviation (MAD) | 0.112034 |
| Skewness | 1.919979745 |
| Sum | 3286627.873 |
| Variance | 0.08073584277 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.017236 | 1126500 | 8.5% |
| 0 | 550952 | 4.2% |
| 0.025854 | 520344 | 3.9% |
| 0.103416 | 503859 | 3.8% |
| 0.094798 | 484109 | 3.7% |
| 0.112034 | 465300 | 3.5% |
| 0.08618 | 399572 | 3.0% |
| 0.120652 | 398964 | 3.0% |
| 0.12927 | 329619 | 2.5% |
| 0.034472 | 316644 | 2.4% |
| Other values (202) | 8117778 |
| Value | Count | Frequency (%) |
| 0 | 550952 | |
| 0.008618 | 220245 | 1.7% |
| 0.017236 | 1126500 | |
| 0.025854 | 520344 | |
| 0.034472 | 316644 | 2.4% |
| 0.04309 | 244221 | 1.8% |
| 0.051708 | 216459 | 1.6% |
| 0.060326 | 211282 | 1.6% |
| 0.068944 | 237335 | 1.8% |
| 0.077562 | 299869 | 2.3% |
| Value | Count | Frequency (%) |
| 1.818398 | 4 | |
| 1.80978 | 3 | < 0.1% |
| 1.801162 | 3 | < 0.1% |
| 1.792544 | 2 | < 0.1% |
| 1.783926 | 2 | < 0.1% |
| 1.775308 | 2 | < 0.1% |
| 1.76669 | 1 | < 0.1% |
| 1.758072 | 5 | |
| 1.749454 | 9 | |
| 1.740836 | 3 | < 0.1% |
| Distinct | 104 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 125.7003492 |
| Minimum | 34 |
|---|---|
| Maximum | 510 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 100.8 MiB |
Quantile statistics
| Minimum | 34 |
|---|---|
| 5-th percentile | 102 |
| Q1 | 106 |
| median | 114 |
| Q3 | 134 |
| 95-th percentile | 188 |
| Maximum | 510 |
| Range | 476 |
| Interquartile range (IQR) | 28 |
Descriptive statistics
| Standard deviation | 28.449274 |
|---|---|
| Coefficient of variation (CV) | 0.2263261334 |
| Kurtosis | 4.069696944 |
| Mean | 125.7003492 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 1.932728418 |
| Sum | 1660959288 |
| Variance | 809.361191 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 102 | 1280254 | 9.7% |
| 104 | 1037794 | 7.9% |
| 112 | 1036325 | 7.8% |
| 110 | 1010191 | 7.6% |
| 114 | 705172 | 5.3% |
| 108 | 647195 | 4.9% |
| 106 | 579224 | 4.4% |
| 116 | 574572 | 4.3% |
| 118 | 430740 | 3.3% |
| 120 | 374896 | 2.8% |
| Other values (94) | 5537278 |
| Value | Count | Frequency (%) |
| 34 | 4 | < 0.1% |
| 50 | 1 | < 0.1% |
| 52 | 4 | < 0.1% |
| 68 | 1 | < 0.1% |
| 84 | 12 | < 0.1% |
| 86 | 10 | < 0.1% |
| 94 | 211 | < 0.1% |
| 96 | 13110 | 0.1% |
| 98 | 70622 | 0.5% |
| 100 | 350127 |
| Value | Count | Frequency (%) |
| 510 | 133 | |
| 508 | 2 | < 0.1% |
| 284 | 4 | < 0.1% |
| 282 | 6 | < 0.1% |
| 280 | 5 | < 0.1% |
| 278 | 5 | < 0.1% |
| 276 | 14 | < 0.1% |
| 274 | 5 | < 0.1% |
| 272 | 17 | < 0.1% |
| 270 | 24 | < 0.1% |
| Distinct | 251 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37.65258012 |
| Minimum | 0 |
|---|---|
| Maximum | 100 |
| Zeros | 5299357 |
| Zeros (%) | 40.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 100.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 40.4 |
| Q3 | 67.6 |
| 95-th percentile | 100 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range (IQR) | 67.6 |
Descriptive statistics
| Standard deviation | 35.72304961 |
|---|---|
| Coefficient of variation (CV) | 0.9487543616 |
| Kurtosis | -1.416464228 |
| Mean | 37.65258012 |
| Median Absolute Deviation (MAD) | 40.4 |
| Skewness | 0.2449926402 |
| Sum | 497527676.4 |
| Variance | 1276.136274 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 5299357 | |
| 100 | 809381 | 6.1% |
| 61.6 | 61356 | 0.5% |
| 64.8 | 61114 | 0.5% |
| 62.4 | 61097 | 0.5% |
| 60.8 | 61013 | 0.5% |
| 59.6 | 60209 | 0.5% |
| 62 | 60110 | 0.5% |
| 64 | 60081 | 0.5% |
| 61.2 | 60017 | 0.5% |
| Other values (241) | 6619906 |
| Value | Count | Frequency (%) |
| 0 | 5299357 | |
| 0.4 | 5214 | < 0.1% |
| 0.8 | 5531 | < 0.1% |
| 1.2 | 5537 | < 0.1% |
| 1.6 | 5688 | < 0.1% |
| 2 | 5493 | < 0.1% |
| 2.4 | 6151 | < 0.1% |
| 2.8 | 5830 | < 0.1% |
| 3.2 | 6063 | < 0.1% |
| 3.6 | 6012 | < 0.1% |
| Value | Count | Frequency (%) |
| 100 | 809381 | |
| 99.6 | 15577 | 0.1% |
| 99.2 | 16307 | 0.1% |
| 98.8 | 16024 | 0.1% |
| 98.4 | 15586 | 0.1% |
| 98 | 16315 | 0.1% |
| 97.6 | 17068 | 0.1% |
| 97.2 | 16498 | 0.1% |
| 96.8 | 16606 | 0.1% |
| 96.4 | 17084 | 0.1% |
| Distinct | 1055 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37.54432759 |
| Minimum | 0 |
|---|---|
| Maximum | 255.97971 |
| Zeros | 1889729 |
| Zeros (%) | 14.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 100.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 16.498944 |
| median | 39.497472 |
| Q3 | 57.394764 |
| 95-th percentile | 75.89358 |
| Maximum | 255.97971 |
| Range | 255.97971 |
| Interquartile range (IQR) | 40.89582 |
Descriptive statistics
| Standard deviation | 24.93111938 |
|---|---|
| Coefficient of variation (CV) | 0.6640449033 |
| Kurtosis | -0.8299271261 |
| Mean | 37.54432759 |
| Median Absolute Deviation (MAD) | 20.397132 |
| Skewness | 0.001613865114 |
| Sum | 496097266.3 |
| Variance | 621.5607136 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 1889729 | 14.3% |
| 48.895308 | 23181 | 0.2% |
| 48.39534 | 22685 | 0.2% |
| 48.094578 | 22680 | 0.2% |
| 45.696294 | 22556 | 0.2% |
| 47.793816 | 22357 | 0.2% |
| 47.094642 | 22166 | 0.2% |
| 47.59461 | 21898 | 0.2% |
| 47.395404 | 21840 | 0.2% |
| 46.094706 | 21807 | 0.2% |
| Other values (1045) | 11122742 |
| Value | Count | Frequency (%) |
| 0 | 1889729 | |
| 0.999936 | 5199 | < 0.1% |
| 1.097586 | 4413 | < 0.1% |
| 1.199142 | 4848 | < 0.1% |
| 1.296792 | 6383 | < 0.1% |
| 1.398348 | 5544 | < 0.1% |
| 1.499904 | 5700 | < 0.1% |
| 1.597554 | 9266 | 0.1% |
| 1.69911 | 5943 | < 0.1% |
| 1.79676 | 5894 | < 0.1% |
| Value | Count | Frequency (%) |
| 255.97971 | 135 | < 0.1% |
| 255.975804 | 491 | |
| 115.492608 | 1 | < 0.1% |
| 115.289496 | 2 | < 0.1% |
| 115.191846 | 2 | < 0.1% |
| 114.891084 | 2 | < 0.1% |
| 114.789528 | 3 | < 0.1% |
| 114.590322 | 1 | < 0.1% |
| 114.492672 | 1 | < 0.1% |
| 114.28956 | 1 | < 0.1% |
| Distinct | 237 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.268935368 |
| Minimum | 0 |
|---|---|
| Maximum | 96.8 |
| Zeros | 10745857 |
| Zeros (%) | 81.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 100.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 21.6 |
| Maximum | 96.8 |
| Range | 96.8 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 7.493586926 |
|---|---|
| Coefficient of variation (CV) | 2.292363135 |
| Kurtosis | 4.201458002 |
| Mean | 3.268935368 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.211387693 |
| Sum | 43194538.4 |
| Variance | 56.15384502 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0 | 10745857 | |
| 16 | 118023 | 0.9% |
| 15.2 | 111147 | 0.8% |
| 16.4 | 110151 | 0.8% |
| 16.8 | 102150 | 0.8% |
| 15.6 | 100220 | 0.8% |
| 14.8 | 86153 | 0.7% |
| 17.2 | 70887 | 0.5% |
| 14.4 | 61652 | 0.5% |
| 18 | 58330 | 0.4% |
| Other values (227) | 1649071 | 12.5% |
| Value | Count | Frequency (%) |
| 0 | 10745857 | |
| 0.4 | 44160 | 0.3% |
| 0.8 | 16989 | 0.1% |
| 1.2 | 12283 | 0.1% |
| 1.6 | 11018 | 0.1% |
| 2 | 11368 | 0.1% |
| 2.4 | 10285 | 0.1% |
| 2.8 | 9691 | 0.1% |
| 3.2 | 10309 | 0.1% |
| 3.6 | 7897 | 0.1% |
| Value | Count | Frequency (%) |
| 96.8 | 18 | < 0.1% |
| 96.4 | 49 | |
| 96 | 1 | < 0.1% |
| 95.6 | 2 | < 0.1% |
| 95.2 | 2 | < 0.1% |
| 94.8 | 3 | < 0.1% |
| 94.4 | 1 | < 0.1% |
| 93.6 | 3 | < 0.1% |
| 92.8 | 2 | < 0.1% |
| 92.4 | 2 | < 0.1% |
Auto
The auto setting is an easily interpretable pairwise column metric of the following mapping: vartype-vartype : method, categorical-categorical : Cramer's V, numerical-categorical : Cramer's V (using a discretized numerical column), numerical-numerical : Spearman's ρ. This configuration uses the best suitable for each pair of columns.Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| Timestamp | WetTankAirPressure | LongitudAcc | EngineSpeed | Fuel Rate | Engine Load | Boost Pressure | EngineAirInletPressure | AcceleratorPedalPos | VehicleSpeed | BrakePedalPos | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1.717422e+10 | 11.58360 | 0.2 | 1103.750 | 15.496514 | 49.0 | 0.120652 | 112.0 | 69.2 | 19.498752 | 0.0 |
| 1 | 1.717422e+10 | 11.58360 | 0.1 | 1194.500 | 21.706949 | 42.0 | 0.232686 | 124.0 | 63.2 | 21.596274 | 0.0 |
| 2 | 1.717422e+10 | 11.58360 | -0.6 | 1183.000 | 7.215934 | 14.5 | 0.189596 | 120.0 | 40.8 | 21.697830 | 0.0 |
| 3 | 1.717422e+10 | 11.51465 | 0.0 | 1156.500 | 3.726261 | 7.5 | 0.155124 | 114.0 | 30.8 | 21.397068 | 0.0 |
| 4 | 1.717423e+10 | 11.51465 | -0.8 | 1020.250 | 0.000000 | 0.0 | 0.112034 | 112.0 | 0.0 | 19.498752 | 10.4 |
| 5 | 1.717423e+10 | 11.44570 | -1.0 | 911.125 | 0.000000 | 0.0 | 0.094798 | 110.0 | 0.0 | 16.596594 | 16.8 |
| 6 | 1.717423e+10 | 11.44570 | -0.2 | 615.250 | 0.000000 | 0.0 | 0.068944 | 106.0 | 0.0 | 12.096882 | 0.0 |
| 7 | 1.717423e+10 | 11.44570 | 0.0 | 968.750 | 14.491015 | 40.5 | 0.034472 | 104.0 | 46.8 | 11.796120 | 0.0 |
| 8 | 1.717423e+10 | 11.37675 | 0.5 | 1115.125 | 13.781251 | 31.5 | 0.051708 | 106.0 | 52.4 | 12.499200 | 0.0 |
| 9 | 1.717423e+10 | 11.37675 | 0.7 | 1247.250 | 12.361723 | 26.0 | 0.086180 | 108.0 | 55.2 | 13.698342 | 0.0 |
Last rows
| Timestamp | WetTankAirPressure | LongitudAcc | EngineSpeed | Fuel Rate | Engine Load | Boost Pressure | EngineAirInletPressure | AcceleratorPedalPos | VehicleSpeed | BrakePedalPos | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 13213631 | 8.030538e+10 | 0.06895 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 102.0 | 0.0 | 0.0 | 0.0 |
| 13213632 | 8.030538e+10 | 0.06895 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 102.0 | 0.0 | 0.0 | 0.0 |
| 13213633 | 8.030538e+10 | 0.06895 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 102.0 | 0.0 | 0.0 | 0.0 |
| 13213634 | 8.030538e+10 | 0.06895 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 102.0 | 0.0 | 0.0 | 0.0 |
| 13213635 | 8.030538e+10 | 0.06895 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 102.0 | 0.0 | 0.0 | 0.0 |
| 13213636 | 8.030538e+10 | 0.06895 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 102.0 | 0.0 | 0.0 | 0.0 |
| 13213637 | 8.030538e+10 | 0.06895 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 102.0 | 0.0 | 0.0 | 0.0 |
| 13213638 | 8.030538e+10 | 0.06895 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 102.0 | 0.0 | 0.0 | 0.0 |
| 13213639 | 8.030539e+10 | 0.06895 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 102.0 | 0.0 | 0.0 | 0.0 |
| 13213640 | 8.030539e+10 | 0.06895 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 102.0 | 0.0 | 0.0 | 0.0 |